Search CORE

162 research outputs found

New decoding algorithms for Hidden Markov Models using distance measures on labellings

Author: A Krogh
B Brejová
Daniel G Brown
ELL Sonnhammer
GE Tusnady
Jakub Truszkowski
L Käll
L Käll
M Stanke
P Fariselli
R Durbin
SL Cawley
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Existing hidden Markov model decoding algorithms do not focus on approximately identifying the sequence feature boundaries. Results We give a set of algorithms to compute the conditional probability of all labellings "near" a reference labelling <it>λ </it>for a sequence <it>y </it>for a variety of definitions of "near". In addition, we give optimization algorithms to find the best labelling for a sequence in the robust sense of having all of its feature boundaries nearly correct. Natural problems in this domain are <it>NP</it>-hard to optimize. For membrane proteins, our algorithms find the approximate topology of such proteins with comparable success to existing programs, while being substantially more accurate in estimating the positions of transmembrane helix boundaries. Conclusion More robust HMM decoding may allow for better analysis of sequence features, in reasonable runtimes.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

A Combination of Compositional Index and Genetic Algorithm for Predicting Transmembrane Helical Segments

Author: A Krogh
A Thomas
B Rost
E Falkenauer
E Wallin
EL Sonnhammer
F Tekaia
G Tusnady
G von Heijne
GE Tusnady
H Berman
H Shen
H Zhou
J Holland
J Pylouster
JM Cuthbertson
L Kall
M Cserzo
M Suyama
MG Claros
Nazar Zaki
Pierandrea Temussi
R Garey
RY Kahsay
S Hosseini
S Jayasinghe
S Roy
Salah Bouktif
Sanja Lazarova-Molnar
T Hirokawa
T Nugent
T Taylor
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

Transmembrane helix (TMH) topology prediction is becoming a focal problem in bioinformatics because the structure of TM proteins is difficult to determine using experimental methods. Therefore, methods that can computationally predict the topology of helical membrane proteins are highly desirable. In this paper we introduce TMHindex, a method for detecting TMH segments using only the amino acid sequence information. Each amino acid in a protein sequence is represented by a Compositional Index, which is deduced from a combination of the difference in amino acid occurrences in TMH and non-TMH segments in training protein sequences and the amino acid composition information. Furthermore, a genetic algorithm was employed to find the optimal threshold value for the separation of TMH segments from non-TMH segments. The method successfully predicted 376 out of the 378 TMH segments in a dataset consisting of 70 test protein sequences. The sensitivity and specificity for classifying each amino acid in every protein sequence in the dataset was 0.901 and 0.865, respectively. To assess the generality of TMHindex, we also tested the approach on another standard 73-protein 3D helix dataset. TMHindex correctly predicted 91.8% of proteins based on TM segments. The level of the accuracy achieved using TMHindex in comparison to other recent approaches for predicting the topology of TM proteins is a strong argument in favor of our proposed method. Availability: The datasets, software together with supplementary materials are available at: http://faculty.uaeu.ac.ae/nzaki/TMHindex.htm

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

University of Southern Denmark Research Output

Identification of Giardia lamblia DHHC Proteins and the Role of Protein S-palmitoylation in the Encystation Process

Author: A Criscuolo
A Hiltpold
A Krogh
AB Hehl
AF Roth
AF Roth
AI Magee
Andrea S. Rópolo
B Eisenhaber
BC Jennings
BJ Davids
BJ Davids
BR Martin
BT Emmer
C Aicart-Ramos
C Aurrecoechea
C Faso
C Notredame
C Salaun
Cecilia V. Vranych
CH Sun
CH Sun
CH Wang
D Lloyd
DA Mitchell
DA Mitchell
DB Keister
DJ Bartels
DS Reiner
DS Reiner
EG Politis
F Abascal
FD Gillin
G DeJesus
GE Tusnady
GE Tusnady
HD Lujan
HD Lujan
HG Elmendorf
HG Morrison
I Letunic
I Letunic
I Slavin
J Greaves
J Greaves
J Greaves
J Pei
J Ren
J Schindelin
J Schultz
J Wan
J Yee
JA Duncan
JE Smotrys
JO Ebinu
JR Beck
JT Dunphy
K Frenal
KB Nicholas
KJ Livak
L Morf
LH Su
M Anisimova
M Fukata
M Punta
M Saric
María C. Merino
María C. Touz
MC Touz
MC Touz
MC Touz
MC Touz
MD Resh
ME Linder
ME Linder
MG De Napoli
ML Jones
ML Jones
MM Corvi
MM Zhang
MR Rivero
MS Brown
N Gottig
Nahuel Zamponi
O Batistic
O Rocks
P Papanastasiou
PG Carranza
R Tsutsumi
RD Adam
RD Finn
S Guindon
S Lobo
S Sonda
S Stoven
SE Boucher
SF Chuang
SL Planey
SM Singer
SR Birkeland
Steven M. Singer
T Hoppe
T Lauwaet
TE Nash
TE Nash
TN Petersen
TS Worgall
WF Leong
Y Fukata
Y He
Y Huang
Y Ohno
Y Webb
YC Huang
YJ Pan
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/07/2014
Field of study

Protein S-palmitoylation, a hydrophobic post-translational modification, is performed by protein acyltransferases that have a common DHHC Cys-rich domain (DHHC proteins), and provides a regulatory switch for protein membrane association. In this work, we analyzed the presence of DHHC proteins in the protozoa parasite Giardia lamblia and the function of the reversible S-palmitoylation of proteins during parasite differentiation into cyst. Two specific events were observed: encysting cells displayed a larger amount of palmitoylated proteins, and parasites treated with palmitoylation inhibitors produced a reduced number of mature cysts. With bioinformatics tools, we found nine DHHC proteins, potential protein acyltransferases, in the Giardia proteome. These proteins displayed a conserved structure when compared to different organisms and are distributed in different monophyletic clades. Although all Giardia DHHC proteins were found to be present in trophozoites and encysting cells, these proteins showed a different intracellular localization in trophozoites and seemed to be differently involved in the encystation process when they were overexpressed. dhhc transgenic parasites showed a different pattern of cyst wall protein expression and yielded different amounts of mature cysts when they were induced to encyst. Our findings disclosed some important issues regarding the role of DHHC proteins and palmitoylation during Giardia encystation.Fil: Merino, Maria Cecilia. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - Córdoba. Instituto de Investigación Médica Mercedes y Martín Ferreyra. Universidad Nacional de Córdoba. Instituto de Investigación Médica Mercedes y Martín Ferreyra; ArgentinaFil: Zamponi, Nahuel. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - Córdoba. Instituto de Investigación Médica Mercedes y Martín Ferreyra. Universidad Nacional de Córdoba. Instituto de Investigación Médica Mercedes y Martín Ferreyra; ArgentinaFil: Vranych, Cecilia Verónica. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - Córdoba. Instituto de Investigación Médica Mercedes y Martín Ferreyra. Universidad Nacional de Córdoba. Instituto de Investigación Médica Mercedes y Martín Ferreyra; ArgentinaFil: Touz, Maria Carolina. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - Córdoba. Instituto de Investigación Médica Mercedes y Martín Ferreyra. Universidad Nacional de Córdoba. Instituto de Investigación Médica Mercedes y Martín Ferreyra; ArgentinaFil: Ropolo, Andrea Silvana. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - Córdoba. Instituto de Investigación Médica Mercedes y Martín Ferreyra. Universidad Nacional de Córdoba. Instituto de Investigación Médica Mercedes y Martín Ferreyra; Argentin

Crossref

CONICET Digital

Directory of Open Access Journals

PubMed Central

FigShare

Mechanistic Insights into a Novel Exporter-Importer System of Mycobacterium tuberculosis Unravel Its Role in Trafficking of Iron

Author: A Bairoch
A Stintzi
Aisha Farhana
Anil K. Tyagi
B Miroux
B Schwyn
C Ratledge
C Ratledge
CH Fiske
D Kaushal
D Levy
Dana Davis
ED Weinberg
F Hoegy
GE Tusnady
GE Tusnady
GM Rodriguez
GM Rodriguez
GM Rodriguez
J Gobin
J Sambrook
JB Neilands
JE Walker
JH Crosa
JH Crosa
JJ De Voss
JL Furrer
L Cronje
L Rindi
M Braibant
N Dhar
Nasreen Z. Ehtesham
P Kuzmic
P Prakash
P Prakash
Prahlad C. Ghosh
R Krithika
R Tam
RD Finn
S Banerjee
S Schubert
S Tundup
Sandeep Kumar
Seyed E. Hasnain
SF Altschul
Shailendra S. Rathore
T Parish
TJ Brickman
UE Schaible
W Zhu
WR Beisel
XZ Li
Publication venue: Public Library of Science
Publication date: 01/05/2008
Field of study

Elucidation of the basic mechanistic and biochemical principles underlying siderophore mediated iron uptake in mycobacteria is crucial for targeting this principal survival strategy vis-à-vis virulence determinants of the pathogen. Although, an understanding of siderophore biosynthesis is known, the mechanism of their secretion and uptake still remains elusive.Here, we demonstrate an interplay among three iron regulated Mycobacterium tuberculosis (M.tb) proteins, namely, Rv1348 (IrtA), Rv1349 (IrtB) and Rv2895c in export and import of M.tb siderophores across the membrane and the consequent iron uptake. IrtA, interestingly, has a fused N-terminal substrate binding domain (SBD), representing an atypical subset of ABC transporters, unlike IrtB that harbors only the permease and ATPase domain. SBD selectively binds to non-ferrated siderophores whereas Rv2895c exhibits relatively higher affinity towards ferrated siderophores. An interaction between the permease domain of IrtB and Rv2895c is evident from GST pull-down assay. In vitro liposome reconstitution experiments further demonstrate that IrtA is indeed a siderophore exporter and the two-component IrtB-Rv2895c system is an importer of ferrated siderophores. Knockout of msmeg_6554, the irtA homologue in Mycobacterium smegmatis, resulted in an impaired M.tb siderophore export that is restored upon complementation with M.tb irtA.Our data suggest the interplay of three proteins, namely IrtA, IrtB and Rv2895c in synergizing the balance of siderophores and thus iron inside the mycobacterial cell

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Systematic search for putative new domain families in Mycoplasma gallisepticum genome

Author: A Lupas
A Marchler-Bauer
AG Murzin
B Rost
Bernard Offmann
CA Orengo
CC Reddy
CC Reddy
Chilamakuri CS Reddy
CS Reddy
EL Sonnhammer
GE Tusnady
J Park
JD Thompson
K Tamura
L Papazisi
LJ McGuffin
N Saitou
NC Kyrpides
R Sowdhamini
R Sowdhamini
S Dietmann
Sane Sudha Rani
SF Altschul
SR Eddy
T Nakatsu
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Protein domains are the fundamental units of protein structure, function and evolution. The delineation of different domains in proteins is important for classification, understanding of structure, function and evolution. The delineation of protein domains within a polypeptide chain, namely at the genome scale, can be achieved in several ways but may remain problematic in many instances. Difficulties in identifying the domain content of a given sequence arise when the query sequence has no homologues with experimentally determined structure and searching against sequence domain databases also results in insignificant matches. Identification of domains under low sequence identity conditions and lack of structural homologues acquire a crucial importance especially at the genomic scale. Findings We have developed a new method for the identification of domains in unassigned regions through indirect connections and scaled up its application to the analysis of 434 unassigned regions in 726 protein sequences of <it>Mycoplasma gallisepticum </it>genome. We could establish 71 new domain relationships and probable 63 putative new domain families through intermediate sequences in the unassigned regions, which importantly represent an overall 10% increase in PfamA domain annotation over the direct assignment in this genome. Conclusions The systematic analysis of the unassigned regions in the <it>Mycoplasma gallisepticum </it>genome has provided some insight into the possible new domain relationships and putative new domain families. Further investigation of these predicted new domains may prove beneficial in improving the existing domain prediction algorithms.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Hal-Diderot

Functional discrimination of membrane proteins using machine learning techniques

Author: AG Garrow
B Rost
D Fu
DP Chimento
DP Chimento
EL Borths
G von Heijne
GE Tusnady
IH Witten
J Abramson
M Michael Gromiha
MH Saier Jr
MH Saier Jr
MM Gromiha
MM Gromiha
MM Gromiha
MM Gromiha
NK Natt
PG Bagos
PL Martelli
Q Ren
R Dutzler
S Murakami
SF Altschul
T Hirokawa
T Nogi
Y Huang
YD Cai
YH Taguchi
Yukimitsu Yabuki
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Abstract Background Discriminating membrane proteins based on their functions is an important task in genome annotation. In this work, we have analyzed the characteristic features of amino acid residues in membrane proteins that perform major functions, such as channels/pores, electrochemical potential-driven transporters and primary active transporters. Results We observed that the residues Asp, Asn and Tyr are dominant in channels/pores whereas the composition of hydrophobic residues, Phe, Gly, Ile, Leu and Val is high in electrochemical potential-driven transporters. The composition of all the amino acids in primary active transporters lies in between other two classes of proteins. We have utilized different machine learning algorithms, such as, Bayes rule, Logistic function, Neural network, Support vector machine, Decision tree etc. for discriminating these classes of proteins. We observed that most of the algorithms have discriminated them with similar accuracy. The neural network method discriminated the channels/pores, electrochemical potential-driven transporters and active transporters with the 5-fold cross validation accuracy of 64% in a data set of 1718 membrane proteins. The application of amino acid occurrence improved the overall accuracy to 68%. In addition, we have discriminated transporters from other α-helical and β-barrel membrane proteins with the accuracy of 85% using k-nearest neighbor method. The classification of transporters and all other proteins (globular and membrane) showed the accuracy of 82%. Conclusion The performance of discrimination with amino acid occurrence is better than that with amino acid composition. We suggest that this method could be effectively used to discriminate transporters from all other globular and membrane proteins, and classify them into channels/pores, electrochemical and active transporters.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Mathematical model for empirically optimizing large scale production of soluble protein domains

Author: A Fontana
A Kouranov
Atsushi Kurotani
BH Dessailly
C Zhang
D Christ
DT Jones
E Chikayama
Eisuke Chikayama
F Corpet
GE Folkers
GE Tusnady
HM Berman
JM Chandonia
M Dumontier
M Suyama
PB Card
R Kikuno
RL Marsden
S Cabantous
S Dokudovskaya
S Miyazaki
S Miyazaki
Satoshi Miyazaki
SF Altschul
Shigeyuki Yokoyama
SJ Wheelan
T Hondoh
T Kigawa
T Niwa
T Tanaka
Takanori Tanaka
Takashi Yabuki
TC Terwilliger
X Gao
Y Kuroda
Yutaka Kuroda
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Transmembrane protein topology prediction using support vector machines

Author: A Krogh
A Kyttälä
A Lo
B Boeckmann
B Rost
BW Matthews
C Pasquier
CP Chen
D Perlman
DA Benson
David T Jones
DT Jones
DT Jones
E Granseth
E Wallin
E Wong
G Gafvelin
G Lasso
G von Heijne
G von Heijne
GE Tusnady
GE Tusnády
GE Tusnády
H Viklund
H Viklund
H Viklund
H Viklund
HM Berman
HR Petty
JD Bendtsen
JM Cuthbertson
K Melén
L Käll
L Käll
M Amico
M Hedman
MA Lomize
O Emanuelsson
P Flicek
PG Bagos
PG Bagos
PL Martelli
Q Mao
S Abe
S Jayasinghe
S Möller
S White
SF Altschul
T Hirokawa
T Joachims
TA Larsson
Timothy Nugent
V Vapnik
Y Chen
Y Liu
Y Liu
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Background: Alpha-helical transmembrane (TM) proteins are involved in a wide range of important biological processes such as cell signaling, transport of membrane-impermeable molecules, cell-cell communication, cell recognition and cell adhesion. Many are also prime drug targets, and it has been estimated that more than half of all drugs currently on the market target membrane proteins. However, due to the experimental difficulties involved in obtaining high quality crystals, this class of protein is severely under-represented in structural databases. In the absence of structural data, sequence-based prediction methods allow TM protein topology to be investigated.Results: We present a support vector machine-based (SVM) TM protein topology predictor that integrates both signal peptide and re-entrant helix prediction, benchmarked with full cross-validation on a novel data set of 131 sequences with known crystal structures. The method achieves topology prediction accuracy of 89%, while signal peptides and re-entrant helices are predicted with 93% and 44% accuracy respectively. An additional SVM trained to discriminate between globular and TM proteins detected zero false positives, with a low false negative rate of 0.4%. We present the results of applying these tools to a number of complete genomes. Source code, data sets and a web server are freely available from http://bioinf.cs.ucl.ac.uk/psipred/.Conclusion: The high accuracy of TM topology prediction which includes detection of both signal peptides and re-entrant helices, combined with the ability to effectively discriminate between TM and globular proteins, make this method ideally suited to whole genome annotation of alpha-helical transmembrane proteins

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

UCL Discovery

Predicting protein-protein binding sites in membrane proteins

Author: A Elofsson
A Koike
A Liaw
AJ Bordner
AJ Bordner
AJ Bordner
AJ Bordner
Andrew J Bordner
B Wang
C Yan
D Lupo
E Krissinel
GE Tusnady
H Chen
H Neuvirth
HX Zhou
I Res
JR Bradford
L Breiman
L Feng
MA Yildirim
NJ Burgoyne
P Fariselli
R Development Core Team
R Landgraf
RC Edgar
S Hartel-Schenk
S Jones
S Jones
SA Eyers
SF Altschul
SH White
TM Bakheet
W Li
XW Chen
Y Ofran
Publication venue: BioMed Central
Publication date: 01/09/2009
Field of study

Abstract Background Many integral membrane proteins, like their non-membrane counterparts, form either transient or permanent multi-subunit complexes in order to carry out their biochemical function. Computational methods that provide structural details of these interactions are needed since, despite their importance, relatively few structures of membrane protein complexes are available. Results We present a method for predicting which residues are in protein-protein binding sites within the transmembrane regions of membrane proteins. The method uses a Random Forest classifier trained on residue type distributions and evolutionary conservation for individual surface residues, followed by spatial averaging of the residue scores. The prediction accuracy achieved for membrane proteins is comparable to that for non-membrane proteins. Also, like previous results for non-membrane proteins, the accuracy is significantly higher for residues distant from the binding site boundary. Furthermore, a predictor trained on non-membrane proteins was found to yield poor accuracy on membrane proteins, as expected from the different distribution of surface residue types between the two classes of proteins. Thus, although the same procedure can be used to predict binding sites in membrane and non-membrane proteins, separate predictors trained on each class of proteins are required. Finally, the contribution of each residue property to the overall prediction accuracy is analyzed and prediction examples are discussed. Conclusion Given a membrane protein structure and a multiple alignment of related sequences, the presented method gives a prioritized list of which surface residues participate in intramembrane protein-protein interactions. The method has potential applications in guiding the experimental verification of membrane protein interactions, structure-based drug discovery, and also in constraining the search space for computational methods, such as protein docking or threading, that predict membrane protein complex structures.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Differential effects of human and plant N-acetylglucosaminyltransferase I (GnTI) in plants

Author: A Schaewen von
Alexander van der Krol
Bas Heinhuis
C Saint-Jore-Dupas
CS Lisenbee
Dirk Bosch
E Grabenhorst
F Brandizzi
FA Engelen van
FG Masclaux
GE Tusnady
H Bakker
H Puthalakath
H Schachter
I Wenderoth
J Aker
J Burke
J Helenius
Jan Willem Borst
Jochem Eigenhuijsen
K Shah
KJ Livak
L Gomez
M Grebe
M Sarkar
Mariëlle Schreuder
Maurice Henquet
MD Snider
NQ Palacpac
R Kornfeld
R Kumar
R Strasser
R Strasser
S Pownall
S Yoshida
SJ Clough
T Fukada
U Neumann
Publication venue: Springer Netherlands
Publication date: 01/01/2009
Field of study

In plants and animals, the first step in complex type N-glycan formation on glycoproteins is catalyzed by N-acetylglucosaminyltransferase I (GnTI). We show that the cgl1-1 mutant of Arabidopsis, which lacks GnTI activity, is fully complemented by YFP-labeled plant AtGnTI, but only partially complemented by YFP-labeled human HuGnTI and that this is due to post-transcriptional events. In contrast to AtGnTI-YFP, only low levels of HuGnTI-YFP protein was detected in transgenic plants. In protoplast co-transfection experiments all GnTI-YFP fusion proteins co-localized with a Golgi marker protein, but only limited co-localization of AtGnTI and HuGnTI in the same plant protoplast. The partial alternative targeting of HuGnTI in plant protoplasts was alleviated by exchanging the membrane-anchor domain with that of AtGnTI, but in stably transformed cgl1-1 plants this chimeric GnTI still did not lead to full complementation of the cgl1-1 phenotype. Combined, the results indicate that activity of HuGnTI in plants is limited by a combination of reduced protein stability, alternative protein targeting and possibly to some extend to lower enzymatic performance of the catalytic domain in the plant biochemical environment

Crossref

Springer - Publisher Connector

PubMed Central

Utrecht University Repository